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1 Generative modeling for continuous non-linearly embedded visual inference 
A± Cristian Sminchisescu, Allan Jepson 



July 2004 Proceedings of the twenty-first international conference on Machine 
learning ICML '04 

Publisher: ACM Press 

Full text available: ^ pdf(600.99 KB) Additional Information; full citation , abstract , references 

Many difficult visual perception problems, like 3D human motion estimation, can be 
formulated in terms of inference using complex generative models, defined over high- 
dimensional state spaces. Despite progress, optimizing such models is difficult because 
prior knowledge cannot be flexibly integrated in order to reshape an initially designed 
representation space. Nonlinearities, inherent sparsity of high-dimensional training sets, 
and lack of global continuity makes dimensionality reduction chall ... 



2 Introduction to Bayesian learning D 

# Aaron Hertzmann 
August 2004 Proceedings of the conference on SIGGRAPH 2004 course notes GRAPH 
04 

Publisher: ACM Press 

Full text available: ^ pdf(899.54 KB) Additional Information: full citation , abstract 

Sophisticated computer graphics applications require complex models of appearance, 
motion, natural phenomena, and even artistic style. Such models are often difficult or 
impossible to design by hand. Recent research demonstrates that, instead, we can "learn" 
a dynamical and/or appearance model from captured data, and then synthesize realistic 
new data from the model. For example, we can capture the motions of a human actor and 
then generate new motions as they might be performed by that actor. B ... 

3 Bayesian approach to sensor-based context awareness D 
Panu Korpipaa, Miika Koskinen, Johannes Peltola, Satu-Marja MSkeia, Tapio SeppSnen 

July 2003 Personal and Ubiquitous Computing, volume 7 issue 2 

Publisher: Springer-Verlag 

Full text available: * ^ pdf(933.60 KB) Additional Information: full citation , abstract , index terms 

AbstractThe usability of a mobile device and services can be enhanced by context 
awareness. The aim of this experiment was to expand the set of generally recognizable 
constituents of context concerning personal mobile device usage. Naive Bayesian 
networks were applied to classify the contexts of a mobile device user in her normal daily 
activities. The distinguishing feature of this experiment in comparison to earlier context 
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recognition research is the use of a naive Bayes framework, and an exte ... 

Keywords: Audio context, Bayesian networks, Context awareness, Context recognition, 
Mobile computing, Sensor 

Bayesian grammar induction for language modeling Q 
Stanley F. Chen 

June 1995 Proceedings of the 33rd annual meeting on Association for Computational 
Linguistics 

Publisher: Association for Computational Linguistics 

Full text available: P >pdf(654.11 KB) 

|j Additional Information: full citation , abstract , references , citings 

W Publisher Site 

We describe a corpus-based induction algorithm for probabilistic context-free grammars. 
The algorithm employs a greedy heuristic search within a Bayesian framework, and a 
post-pass using the Inside-Outside algorithm. We compare the performance of our 
algorithm to /7-gram models and the Inside-Outside algorithm in three language modeling 
tasks. In two of the tasks, the training data is generated by a probabilistic context-free 
grammar and in both tasks our algorithm outperforms the other te ... 

Text classification: A refinement approach to handling model misfit in text Q 
categorization 

Haoran Wu, Tong Heng Phang, Bing Liu, Xiaoli Li 

July 2002 Proceedings of the eighth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Publisher: ACM Press 

Full text available: f| odff 170.65 KB) M6iM ,nformation: ^ ation - *StaA references , citings, index 

Text categorization or classification is the automated assigning of text documents to pre- 
defined classes based on their contents. This problem has been studied in information 
retrieval, machine learning and data mining. So far, many effective techniques have been 
proposed. However, most techniques are based on some underlying models and/or 
assumptions. When the data fits the model well, the classification accuracy will be high. 
However, when the data does not fit the model well, the classificat ... 

Keywords: Rocchio algorithm, naive Bayesian classifier, text categorization 



6 A simple approach to building ensembles of Naive Bayesian classifiers for word Q 

sense disambiguation 
Ted Pedersen 

April 2000 Proceedings of the first conference on North American chapter of the 

Association for Computational Linguistics 
Publisher: Morgan Kaufmann Publishers Inc. 

Full text available: "jf ) pdf(677.21 KB) Additional Information: full citation , abstract , references , citings 

This paper presents a corpus-based approach to word sense disambiguation that builds an 
ensemble of Naive Bayesian classifiers, each of which is based on lexical features that 
represent co-occurring words in varying sized windows of context. Despite the simplicity 
of this approach, empirical results disambiguating the widely studied nouns line and 
interest show that such an ensemble achieves accuracy rivaling the best previously 
published results. 
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Bayesian network 
Sangho Park, J. K. Aggarwal 

November 2003 First ACM SIGMM international workshop on Video surveillance 
Publisher: ACM Press 

Full text available: pdf(643.17 KB) Additional Information: full citation , abstract , references , index terms 

Recognizing human interactions is a challenging task due to the multiple body parts of 
interacting persons and the concomitant occlusions. This paper presents a method for the 
recognition of two-person interactions using a hierarchical Bayesian network (BN). The 
poses of simultaneously tracked body parts are estimated at the low level of the BN, and 
the overall body pose is estimated at the high level of the BN. The evolution of the poses 
of the multiple body parts are processed by a dynamic Bay ... 

Keywords: Bayesian network, event recognition, hierarchy, human interaction, motion, 
scene understanding, surveillance 



8 Research track paper: A Bayesian network classifier with inverse tree structure for 
|k voxelwise magnetic resonance image analysis 
* Rong Chen, Edward H. Herskovits 

August 2005 Proceeding of the eleventh ACM SIGKDD international conference on 
Knowledge discovery in data mining KDD '05 

Publisher: ACM Press 

Full text available: pdf(284.43 KB) Additional Information: full citation , abstract , references , index terms 

We propose a Bayesian-network classifier with /nverse-tree structure (BNCIT) for joint 
classification and variable selection. The problem domain of voxelwise magnetic- 
resonance image analysis often involves millions of variables but only dozens of samples. 
Judicious variable selection may render classification tractable, avoid over-fitting, and 
improve classifier performance. BNCIT embeds the variable-selection process within the 
classifier-training process, wh ... 

Keywords: Bayesian network, Markov blanket, classifier, magnetic resonance image 



9 Model Averaging for Prediction with Discrete Bayesian Networks 
Denver Dash, Gregory F. Cooper 

December 2004 The Journal of Machine Learning Research, Volume 5 
Publisher: MIT Press 

Full text available: ^ pdf(267.17 KB) Additional Information: full citation , abstract 

In this paper we consider the problem of performing Bayesian model-averaging over a 
class of discrete Bayesian network structures consistent with a partial ordering and with 
bounded in-degree k. We show that for N nodes this class contains in the worst-case at 
least <img align=middle src=dash04a-omega.jpeg alt="omega eq"> distinct network 
structures, and yet model averaging over these structures can be performed using <img 
align=middle src=dash04a-bigo.jpeg alt="bigo eq"&g ... 

10 Context-specific Bayesian clustering for gene expression data 

#Yoseph Barash, Nir Friedman 
April 2001 Proceedings of the fifth annual international conference on Computational 

biology 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 



Full text available: TO pdf(233.32 KB) 

^^—^ terms 

The recent growth in genomic data and measurement of genome-wide expression 
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patterns allows to examine gene regulation by transcription factors using computational 
tools. In this work, we present a class of mathematical models that help in understanding 
the connections between transcription factors and functional classes of genes based on 
genetic and genomic data. These models represent the joint distribution of transcription 
factor binding sites and of expression levels of a gene in a single ... 

11 PAC-Bayesian model averaging Q 
David A. McAllester 

July 1999 Proceedings of the twelfth annual conference on Computational learning 
theory 

Publisher: ACM Press 

Full text available: pelf (744. 38 KB) Additional Information: full citation , references , citings , index terms 



12 Explaining away ambiguity: learning verb selectional preference with Bayesian Q 
networks 

Massimiliano Ciaramita, Mark Johnson 

July 2000 Proceedings of the 18th conference on Computational linguistics - Volume 
1 

Publisher: Association for Computational Linguistics 

Full text available: ^pdf(608.54 KB) Additional Information: full citation , abstract , references 

This paper presents a Bayesian model for unsupervised learning of verb selectional 
preferences. For each verb the model creates a Bayesian network whose architecture is 
determined by the lexical hierarchy of Wordnet and whose parameters are estimated from 
a list of verb-object pairs found from a corpus. "Explaining away", a well-known property 
of Bayesian networks, helps the model deal in a natural fashion with word sense 
ambiguity in the training data. On a word sense disambiguation test our mo ... 

13 Some PAC-Bavesian theorems Q 

# David A. McAllester 
July 1998 Proceedings of the eleventh annual conference on Computational learning 

theory 
Publisher: ACM Press 

Full text available: ^ pdf(624.84 KB) Additional Information: full citation , references , citings , index terms 



14 Content 4: image analysis and retrieval: (Revolutionary feature synthesized EM Q 

A algorithm for image retrieval 
^ Rui Li, Bir Bhanu, Anlei Dong 

November 2005 Proceedings of the 13th annual ACM international conference on 
Multimedia MULTIMEDIA 05 

Publisher: ACM Press 

Full text available: pdf(709.14 KB) Additional Information: full citation , abstract , references , index terms 

As a commonly used unsupervised learning algorithm in Content-Based Image Retrieval 
(CBIR), Expectation-Maximization (EM) algorithm has several limitations, especially in 
high dimensional feature spaces where the data are limited and the computational cost 
varies exponentially with the number of feature dimensions. Moreover, the convergence is 
guaranteed only at a local maximum. In this paper, we propose a unified framework of a 
novel learning approach, namely Revolutionary Fea ... 

Keywords: revolutionary feature synthesis,, content-based image retrieval, expectation 
maximization algorithm, genetic programming, semi-supervised learning 
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15 A PAC analysis of a Bayesian estimator Q 




John Shawe-Taylor, Robert C. Williamson 
July 1997 Proceedings of the tenth annual conference on Computational learning 



theory 

Publisher: ACM Press 

Full text available: ^ pdfd.25 MB) Additional Information: full citation , references , citings , index terms 



16 Filtering: Bayesian online classifiers for text classification and filtering 
Kian Ming Adam Chai, Hai Leong Chieu, Hwee Tou Ng 

August 2002 Proceedings of the 25th annual international ACM SIGIR conference on 

Research and development in information retrieval 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 



Full text available: ■ 

t-*" terms 

This paper explores the use of Bayesian online classifiers to classify text documents. 
Empirical results indicate that these classifiers are comparable with the best text 
classification systems. Furthermore, the online approach offers the advantage of 
continuous learning in the batch-adaptive text filtering task. 

Keywords: bayesian, machine learning, online, text classification,. text filtering 



17 The Bayesian backfitting relevance vector machine H 
M> Aaron D'Souza, Sethu Vijayakumar, Stefan Schaal 

™ July 2004 Proceedings of the twenty-first international conference on Machine 
learning ICML '04 

Publisher: ACM Press 

Full text available: *g| pdf(343.49 KB) Additional Information: full citation , abstract , references 

Traditional non-parametric statistical learning techniques are often computationally 
attractive, but lack the same generalization and model selection abilities as state-of-the- 
art Bayesian algorithms which, however, are usually computationally prohibitive. This 
paper makes several important contributions that allow Bayesian learning to scale to more 
complex, real-world learning scenarios. Firstly, we show that backfitting — a traditional 
non-parametric, yet highly efficient regression t ... 

Content-based filtering & collaborative filtering: Using bayesian priors to combine Q 

classifiers for adaptive filtering 
Yi Zhang 

July 2004 Proceedings of the 27th annual international ACM SIGIR conference on 
Research and development in information retrieval SIGIR '04 

Publisher: ACM Press 

Full text available: pdf( 179.68 KB) Additional Information: full citation , abstract , references , index terms 

An adaptive information filtering system monitors a document stream to identify the 
documents that match information needs specified by user profiles. As the system filters, 
it also refines its knowledge about the user's information needs based on long-term 
observations of the document stream and periodic feedback(training data) from the user. 
Low variance profile learning algorithms, such as Rocchio, work well at the early stage of 
filtering when the system has very few training data. Low bias ... 

Keywords: bias variance, information filtering, logistic regression 
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19 Learning Bayesian network classifiers bv maximizing conditional likelihood Q 
& Daniel Grossman, Pedro Domingos 

v July 2004 Proceedings of the twenty-first international conference on Machine 
learning ICML '04 

Publisher: ACM Press 

Full text available: ^) pdf(187.23 KB) Additional Information: full citation , abstract , references 

Bayesian networks are a powerful probabilistic representation, and their use for 
classification has received considerable attention. However, they tend to perform poorly 
when learned in the standard way. This is attributable to a mismatch between the 
objective function used (likelihood or a function thereof) and the goal of classification 
(maximizing accuracy or conditional likelihood). Unfortunately, the computational cost of 
optimizing structure and parameters for conditional likelihood is pro ... 



20 Modeling dependencies in protein-DNA binding sites Q 
f $k Yoseph Barash, Gal Elidan, Nir Friedman, Tommy Kaplan 

April 2003 Proceedings of the seventh annual international conference on Research 

in computational molecular biology RECOMB '03 
Publisher: ACM Press 

Full text available' odf(41 1 94 KB) Additional Information: full citation , abstract , references , citings , index 

: terms 

The availability of whole genome sequences and high-throughput genomic assays opens 
the door for in silico analysis of transcription regulation. This includes methods for 
discovering and characterizing the binding sites of DNA-binding proteins, such as 
transcription factors. A common representation of transcription factor binding sites is a 
position specific score matrix (PSSM). This representation makes the strong assumption 
that binding site positions are independent of each othe ... 

Keywords: DNA sequence motifs, bayesian networks, factors binding sites, transcription 
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ps] Bayesian training of backpropagation networks by the hybrid Monte Carlo method - 
group of 8 » 

RM Neal - Dept. of Computer Science, University of Toronto, Tech. Rep. 1992 - funet.fi 
... 10 April 1992 Abstract. It is shown that Bayesian training of backpropagation neural 
networks can feasibly be performed by the \Hybrid Monte Carlo" method. ... 
Cited by 40 - View as HTML - Web Search 



[ps] Gaussian Processes for Bayesian Classification via Hybrid Monte Carlo - group of 4 » 
D Barber, CKI Williams - NIPS, 1996 - dai.ed.ac.uk 

... Classication via Hybrid Monte Carlo ... To make predictions based on a set of training 
data, fundamentally ... In the Bayesian approach to neural networks a prior on ... 
Cited by 34 - View as HTML - Web Search - BL Direct 



Probabilistic inference using Markov chain Monte Carlo methods - group of 25 » 
RM Neal - University of Toronto Technical Report CRG-TR-93-1 , 1993 - laplace.compbio.ucsf.edu 
... hybrid Monte Carlo method of Duane, ... In the Boltzmann machine, the Gibbs sampling 
procedure is used to make ... True Bayesian approaches to learning in an arti cial ... 
Cited bv 360 - View as HTML - Web Search 



Monte Carlo Implementation of Gaussian Process Models for Bayesian Regression and 
Classification - group of 7 » 

RM Neal - Arxiv preprint physics/9701026, 1997 - arxiv.org 

... for Bayesian Regression and Classification ... 1984). In a fully Bayesian approach, 
the hyperparameters are given prior distributions. ... 
Cited by 64 - View as HTML - Web Search 



[ps] Gaussian Processes for Regression - group of 7 » 

CKI Williams, CE Rasmussen - NIPS, 1995 -gatsby.ucl.ac.uk 

... for is possible, to recognize the connections to ARD and to use the Hybrid Monte 

Carlo method in the Bayesian treatment (see section 3). 3 TRAINING A GAUSSIAN ... 

Cited by 1 82 - View as HTML - Web Search - BL Direct 
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RM Neal - 1996 - books.google.com 

... 95 3.5.3 Hybrid Monte Carlo with persistent momentum ... 4.1 Network architectures, priors, 
and training procedures ... 4.4 Tests of Bayesian models on real data sets ... 
Cited bv 531 - Web Search - Library Search - BL Direct 
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... Pattern Analysis and Machine Intelligence, 20(12) 1342-1351, (1998). ... one risks 

\overtting" the training data. This motivates a Bayesian treatment ... 

Cited by 96 - View as HTML - Web Search - BL Direct 
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... we formulate the problem of training neural networks ... of many successful sequential 
Bayesian tools, such ... is the basis of the hybrid Monte Carlo algorithm (Brass ... 
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... In order to allow input relevance determination by Bayesian model selection, the ... 
In this case training is carried out using the likelihood of observing inputs ... 
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